Story Link Detection With Entity Resolution
نویسندگان
چکیده
News archives present a vast base of cultural and social knowledge. However, their size is also the cause for difficult navigation through the sequence of articles, belonging to a certain topic thread. In the ideal scenario, one could navigate over the whole sequence of articles, where every article would link to other relevant articles, discussing the same event. Continuing progress in entity resolution and extraction has enabled the possibility to apply semantic background knowledge to the task of story link detection (SLD), adding additional information to existing article text and annotations. In this paper, we propose a method of extracted entity resolution to measure its effect on performance the task of topic link detection. We developed a system which extracts additional entities from article text and links them to entities from our background knowledge base. Current experiments of this ongoing work show that although entity resolution via text similarity outperforms using plain text in the case of story link detection, it only achieves SLD performance comparable to human annotations in some cases.
منابع مشابه
Story Link Detection Based on Event Words
In this paper, we propose an event words based method for story link detection. Different from previous studies, we use time and places to label nouns and named entities, the featured nouns/named entities are called event words. In our approach, a document is represented by five dimensions including nouns/named entities, time featured nouns/named entities, place featured nouns/named entities, t...
متن کاملStory Link Detection based on Dynamic Information Extending
Topic Detection and Tracking refers to automatic techniques for locating topically related materials in streams of data. As the core technology of it, story link detection is to determine whether two stories are about the same topic. To overcome the limitation of the story length and the topic dynamic evolution problem in data streams, this paper presents a method of applying dynamic informatio...
متن کاملThe Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution
This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملStory Link Detection and New Event Detection are Asymmetric
Story link detection has been regarded as a core technology for other Topic Detection and Tracking tasks such as new event detection. In this paper we analyze story link detection and new event detection in a retrieval framework and examine the effect of a number of techniques, including part of speech tagging, new similarity measures, and an expanded stop list, on the performance of the two de...
متن کامل